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Clean Copy of Pending Claims 
Claims 

1 . A DNA construct comprising a transcriptional imit which comprises a 
ribonucleotide reductase gene and a thioredoxin gene or a uridine kinase gene and/or a 
dCTP deaminase gene. 

2. A DNA construct as claimed in claim 1 comprising a transcriptional 
unit which comprises a ribonucleotide reductase gene and a thioredoxin gene. 

3. A DNA construct as claimed in claim 1 comprising a transcriptional 
unit which comprises a uridine kinase gene and/or a dCTP deaminase gene. , 

4. A DNA construct as claimed in claim .1 comprising a transcriptional 
unit which comprises a uridine"^ kinase gene and a dCTP deaminase gene. 

5. A DNA construct according to claim 1 wherein the construct is an 
extrachromosomal vector. 

6. A DNA construct according to claim 5 wherein the vector is a plasmid, 
virus, transposon, minichromosome or phage. 

7. A DNA construct according to claim 6 wherein the vector is a plasmid, 
wherein the ribonucleotide reductase gene is a nrdA gene and/or a nrdB gene, and 
wherein the thioredoxin gene is a nrdC gene and said genes are arranged in an operon. 

8. A DNA construct according to claim 7 wherein the transcriptional unit 
comprises a nrdA gene and a nrdB gene. 

9. A DNA construct according to claim 8 in which the nrdA and nrdB 
genes are located downstream of the nrdC gene. 
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10. A DNA construct according to claim 8 wherein the nrdC gene is 
upstream of the nrdA gene and nrdA gene is upstream of the «rf/B gene. 

11. A DNA construct according to claim 6, further comprising a regulatory 
element. 

12. A DNA construct according to claim 11, wherein the regulatory 
element is selected from the group consisting of a promoter, an operator, a termination 
sequence, an initiation sequence and a ribosome binding site. 

13. A DNA construct according to claim 1 2 wherein the promoter is the 
lambda Pl promoter or a derivative therefrom. 

14. A DNA construct according to claim 12 wherein the termination 
sequence is a synthetic terminator. 

15. A DNA construct according to claim 7 wherein the nrdA gene is 
modified such that ribonucleotide reductase encoded by the imit is less sensitive to 
allosteric inhibition than the ribonucleotide reductase encoded by the unit comprising 
an unmodified nrdA gene. 

16. A DNA construct according to claim 15, wherein the nrdA gene is 
modified at a dTTP binding site. 

17. A DNA construct according to claim 1 5 wherein the nrdA gene is 
modified so as to encode an Ala-to-IIe change at position 79 in the nrdA expression 
product. 

18. A DNA construct according to claim 7 wherein the nrdA, nrdB and 
nrdC genes are derived from a T phage. 
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19. A DNA construct according to claim 1 8 wherein the T phage is a T 
even phage. 

20. A DNA construct according to claim 19 wherein the T even phage is a 
T4 phage. 

21 . A DNA construct according to claim 1 wherein the construct further 
comprises a thymidylate synthase gene. 

22. A DNA construct according to claim 2 1 wherein the thymidylate 
synthase gene is the td gene. 

23. A DNA construct according to claim 22 wherein td gene is located on 
the same vector as the nrdA, nrdB and nrdC genes. 

24. A DNA construct according to claim 22 wherein the td gene is located 
in the same operon as the nrdA, nrdB and nrdC genes. 

25. A DNA construct according to claim 24 wherein the td gene is located 
downstream (in terms of reading frame) from the nrdA, nrdB and nrdC genes. 

26. A DNA construct according to claim 22 wherein the td gene is derived 
from a T phage. 

27. A DNA construct according to claim 26 wherein the td gene is derived 
from a T even phage. 

28. A DNA construct according to claim 27 wherein the td gene is derived 
from T4 phage. 
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29. A DNA construct according to claim 2 further comprising a viridine 
kinase gene and/or a dCTP deaminase gene. 

30. A DNA construct according to claim 29, wherein the DNA construct 
comprises both a uridine kinase gene and a dCTP deaminase gene. 

31. A DNA construct according to claim 1 , wherein the uridine kinase 
gene is a udk gene. 

32. A DNA construct according to claim 1, wherein the dCTP deaminase 
gene is a dcd gene 

33. A modified host cell comprising a DNA construct according to any of 
the preceding claims. 

34. A modified host cell according to claim 33 wherein the host cell 
displays repressed or no uracil DNA glycosylase activity. 

35. A modified host cell according to claim 34 wherein one or more host 
cell DNA sequences encoding for uracil DNA glycosylase activity have been 
modified so as to encode expression products displaying repressed, low levels of or no 
uracil DNA glycosylase activity. 

36. A modified host cell according to claim 35 wherein the host cell DNA 
sequence is an ung gene. 

37. A modified host cell according to claim 34 wherein one or more host 
cell DNA sequences encoding for uracil DNA glycosylase activity have been 
removed. 
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38. A modified host cell according to claim 33 wherein the cell is a 
eukaryote or a prokaryote. 

39. A modified host cell according to claim 38 wherein the host cell is 
selected from the group consisting of E. coli. Salmonella, Pseudomonas, Bacillus and 
Saccharomyces. 

40. A modified host cell comprising a DNA construct, which construct 
comprises a transcription DNA unit, which imit comprises a ribonucleotide reductase 
gene and a thioredoxin gene, wherein said ribonucleotide reductase displays less 
sensitivity to allosteric inhibition than the wild-type equivalent of the reductase, and 
wherein said host cell further comprises one or more of the following features: 

(a) a transcription unit located on said DNA construct, comprising 
a thymidylate synthase gene heterologous to the thymidylate synthase gene of 
the host cell; 

(b) a transcription unit located on said DNA construct, comprising 
a uridine kinase gene; 

(c) a transcription unit located on said DNA construct, comprising 
a dCTP deaminase gene; and 

(d) repressed or absent uracil DNA glycosylase activity. 

41 . A modified host cell according to claim 40, wherein the ribonucleotide 
reductase gene is modified at a dTTP binding site. 

42. A modified host cell according to claim 41 , wherein the ribonucleotide 
reductase is the nrdA gene and the modification is an Ala-to-Ile change at position 79 
in the nrdA expression product. 

43. A modified host cell according to claim 38, wherein the DNA construct 
comprises both the uridine kinase gene and the dCTP deaminase gene. 
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44. A modified host cell according to claim 40, wherein the cell comprises 
each one of the features of (a) to (d). 

45. A process for the production of pyrimidine deoxyribonucleosides 
comprising culturing a host cell according to claim 33. 

46. A process according to claim 45 wherein the deoxyribonucleoside is 
thymidine. 

47. A process for the manufacture of azidothymidine comprising culturing 
a host cell according to claim 33. 

48. A culture medium comprising a host cell according to claim 33. 



Currently claims 1-48 are pending. Claims 5, 21, 31-32, 45, and 47-48 have been 
amended to place them in form appropriate to US practice and to reduce the filing fee by 
removing multiple dependency. Attached hereto is a marked-up version of the changes made 
to the claims by the current amendment. The attached page is captioned " Version With 
Markings To Show Changes Made **. Applicants have attached an abstract on a separate 
sheet of paper as required by US practice. Applicants have amended the specification for 
purposes of adding the priority information. 



GlaxoSmithKline Inc. 
Corporate Intellectual Property 
Five Moore Drive, P.O. Box 13398 
Research Triangle Park, NC 27709 
Phone: 919-483-2482 
Facsimile: 919-483-7988 



REMARKS 




Respectfiilly submitted, 



Trank P. Grassier 
Attorney for Applicants 
Registration No. 3 1 , 1 64 



Date: / / 9"^ 6/ 
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Abstract 

VECTORS, CELLS AND PROCESSES FOR PYRIMIDINE 
DEOXYRIBONUCLEOSIDES PRODUCTION 

Novel DNA constructs and host cells comprising the same are disclosed. 
DNA constructs comprise a transcription unit ( e.g. operon) comprising DNA 
sequences encoding for ribonucleotide reductase and thioredoxin or a uridine kinase 
gene and/or a dCTP deaminase gene. In preferred embodiments the constructs 
comprising DNA sequences encoding for ribonucleotide reductase and thioredoxin 
further comprise DNA sequences encoding for thymidylate synthase and/or 
transcription units comprising sequences encoding for uridine kinase preferably 
together with dCTP deaminase. In particularly preferred embodiments, the host cells 
comprise constructs having all of the above characteristics wherein the host cell 
displays repressed or no uracil DNA glycosylase activity. This may be achieved by 
removal of the host cell ung gene. Use of host cells in the manufacture of pyrimidine 
deoxyribonucleotides e.g. thymidine is also disclosed. 
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Claims 

1 . A DNA construct comprising a transcriptional vinit which comprises a 
ribonucleotide reductase gene and a thioredoxin gene or a uridine kinase gene and/or a 
dCTP deaminase gene. 

2. A DNA construct as claimed in claim 1 comprising a transcriptional 
imit which comprises a ribonucleotide reductase gene and a thioredoxin gene. 

3. A DNA construct as claimed in claim 1 comprising a transcriptional 
unit which comprises a uridine kinase gene and/or a dCTP deaminase gene. 

4. A DNA construct as claimed in claim 1 comprising a transcriptional 
unit which comprises a uridine kinase gene and a dCTP deaminase gene. 

5. A DNA construct according to claim 1 [any of claims 1 -4] wherein the 
construct is an extrachromosomal vector. 

6. A DNA construct according to claim 5 wherein the vector is a plasmid, 
virus, transposon, minichromosome or phage. 

7. A DNA construct according to claim 6 wherein the vector is a plasmid, 
wherein the ribonucleotide reductase gene is a nrdA gene and/or a nrdB gene, and 
wherein the thioredoxin gene is a nrdC gene and said genes are arranged in an operon. 

8. A DNA construct according to claim 7 wherein the transcriptional unit 
comprises a nrdA gene and a nrdB gene. 

9. A DNA construct according to claim 8 in which the nrdA and nrdB 
genes are located downstream of the nrdC gene. 
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10. A DNA construct according to claim 8 wherein the nrdC gene is 
upstream of the nrdA gene and nrdA gene is upstream of the nrdB gene. 

11. A DNA construct according to claim 6, further comprising a regulatory 
element. 

12. A DNA construct according to claim 11, wherein the regulatory 
element is selected from the group consisting of a promoter, an operator, a termination 
sequence, an initiation sequence and a ribosome binding site. 

13. A DNA construct according to claim 12 wherein the promoter is the 
lambda Pl promoter or a derivative therefrom. 

14. A DNA construct according to claim 12 wherein the termination 
sequence is a synthetic terminator. 

15. A DNA construct according to claim 7 wherein the nrdA gene is 
modified such that ribonucleotide reductase encoded by the unit is less sensitive to 
allosteric inhibition than the ribonucleotide reductase encoded by the unit comprising 
an urunodified nrdA gene. 

1 6. A DNA construct according to claim 1 5, wherein the nrdA gene is 
modified at a dTTP binding site. 

17. A DNA construct according to claim 1 5 wherein the nrdA gene is 
modified so as to encode an Ala-to-IIe change at position 79 in the nrdA expression 
product. 

18. A DNA construct according to claim 7 wherein the nrdA, nrdB and 
nrdC genes are derived from a T phage. 
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19. A DNA construct according to claim 1 8 wherein the T phage is a T 
even phage. 

20. A DNA construct according to claim 19 wherein the T even phage is a 
T4 phage. 

21. A DNA construct according to claim 1 [any of claims 1 -4] wherein the 
construct further comprises a thymidylate synthase gene. 

22. A DNA construct according to claim 21 wherein the thymidylate 
synthase gene is the td gene. 

23. A DNA construct according to claim 22 wherein td gene is located on 
the same vector as the nrdA, nrdB and nrdC genes. 

24. A DNA construct according to claim 22 wherein the td gene is located 
in the same operon as the nrdA, nrdB and nrdC genes. 

25. A DNA construct according to claim 24 wherein the td gene is located 
downstream (in terms of reading frame) from the nrdA, nrdB and nrdC genes. 

26. A DNA construct according to cleiim 22 wherein the ^£3? gene is derived 
from a T phage. 

27. A DNA construct according to claim 26 wherein the td gene is derived 
from a T even phage. 

28. A DNA construct according to claim 27 wherein the td gene is derived 
from T4 phage. 
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29. A DNA construct according to claim 2 further comprising a uridine 
kinase gene and/or a dCTP deaminase gene. 

30. A DNA construct according to claim 29, wherein the DNA construct 
comprises both a uridine kinase gene and a dCTP deaminase gene. 

31. A DNA construct according to claim 1 ,[3 or 4] wherein the uridine 
kinase gene is a udk gene. 

32. A DNA construct according to claim 1,[3 or 4] wherein the dCTP 
deaminase gene is a dcd gene 

33. A modified host cell comprising a DNA construct according to any of 
the preceding claims. 

34. A modified host cell according to claim 33 wherein the host cell 
displays repressed or no uracil DNA glycosylase activity. 

35. A modified host cell according to claim 34 wherein one or more host 
cell DNA sequences encoding for uracil DNA glycosylase activity have been 
modified so as to encode expression products displaying repressed, low levels of or no 
uracil DNA glycosylase activity. 

36. A modified host cell according to claim 35 wherein the host cell DNA 
sequence is an ung gene. 

37. A modified host cell according to claim 34 wherein one or more host 
cell DNA sequences encoding for uracil DNA glycosylase activity have been 
removed. 
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38. A modified host cell according to claim 33 wherein the cell is a 
eukaryote or a prokaryote. 

39. A modified host cell according to claim 38 wherein the host cell is 
selected from the group consisting of E. coli. Salmonella, Pseudomonas, Bacillus and 

Saccharomyces. 

40. A modified host cell comprising a DNA construct, which construct 
comprises a transcription DNA unit, which imit comprises a ribonucleotide reductase 
gene and a thioredoxin gene, wherein said ribonucleotide reductase displays less 
sensitivity to allosteric inhibition than the wild-type equivalent of the reductase, and 
wherein said host cell further comprises one or more of the following features: 

(a) a transcription unit located on said DNA construct, comprising 
a thymidylate synthase gene heterologous to the thymidylate synthase gene of 
the host cell; 

(b) a transcription unit located on said DNA construct, comprising 
a uridine kinase gene; 

(c) a transcription unit located on said DNA construct, comprising 
a dCTP deaminase gene; and 

(d) repressed or absent uracil DNA glycosylase activity. 

41 . A modified host cell according to claim 40, wherein the ribonucleotide 
reductase gene is modified at a dTTP binding site. 

42. A modified host cell according to claim 41, wherein the ribonucleotide 
reductase is the nrdA gene and the modification is an Ala-to-Ile change at position 79 
in the nrdA expression product. 

43. A modified host cell according to claim 38, wherein the DNA construct 
comprises both the uridine kinase gene and the dCTP deaminase gene. 
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44. A modified host cell according to claim 40, wherein the cell comprises 
each one of the features of (a) to (d). 

45. A process for the production of pyrimidine deoxyribonucleosides 
comprising culturing a host cell according to claim 33 [any of claims 33-44]. 

46. A process according to claim 45 wherein the deoxyribonucleoside is 
thymidine. 

47. A process for the manufacture of azidothymidine comprising culturing 
a host cell according to claim 33 [any of claims 33-44], 

48. A culture medium comprising a host cell according to claim 33 [any of 
claims 33-44]. 
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Vectors. Cells and Processes for 
Pvrlmidine Deoxvribonucleosides Production 

Field of the Invention 

5 The present invention relates to the production of pyrimidines, purines and derivatives 
thereof e.g. deoxyribonucleosides, using genetically modified cells comprising novel 
DNA constructs. 

Background of the invention 

10 

Thymidine is useful as a pharmaceutical intermediate, particularly for the chemical 
synthesis of azidothymldine ("AZT," sold under the trademark ZIDOVUDINE). 
Although ZIDOVUDINE-type AZT was one of the first therapies developed for 
HIV/AIDS, it continues to have Important and expanded use (Langreth, R., The Wall 

15 Street Journal, Nov. 21 , 1 995, pp B1 2). ZIDOVUDINE-type AZT is valuable particularly 
when used in combination therapies such as a combination with lamivudine (also 
known as 3TC), sold under the trademark EPIVIR. This lamvudine and 3TC 
combination is sold under the trademark COMBIVIR. Although the HIV virus can 
mutate to form resistance to either AZT or 3TC, COMBIVIR-type nucleotide-analog 

20 combination is particularly effective because the reverse transcriptase apparently 

cannot be resistant to both nucleoside analogues at the same time (Larder, B.A. et al.. 
Science 269: 696-699, 1995;. ZIDOVUDINE-type AZT is also useful in conjunction with 
HIV protease inhibitor type drugs (Waldholz, M., The Wall Street Journal, Jan, 30, 
1996, pp B1), and In the treatment of HIV infected pregnant women in order to reduce 

25 the frequency of infection of the fetus at birth. In 1 997 about 600,000 children died of 
AIDS contracted from their mothers at birth. ZIDOVUDINE-type AZT taken for several 
months prior to birth can reduce the transmission of the virus to infants by two-thirds. 
Thymidine produced by chemical synthesis used in the manufacture of AZT Is a very 
significant cost. 

30 

In U.S. 5,213,972 (McCandliss & Anderson, hereinafter "the '972 patent"), the entire 
contents of which are incorporated herein by reference and to which the reader is 
specifically referred, a process for the production of pyrimidine deoxyribonucleoside 
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(PdN) is disclosed (see in particular examples 7 to 14 of the '972 patent). A 
replicatable microorganism comprising and expressing a DNA sequence encoding a 
pyrimidine deoxyribonucleotide phosphohydrolase that converts a PdN monophosphate 
to a pyrimidine deoxyribonucleoside is taught. More particularly, McCandliss & 

5 Anderson, supra, describe a fermentation method that can be used to produce 
thymidine that involves the expression of deoxythymidylate phosphohydrolase 
(dTMPase) from the Bacillus bacteriophage PBS1 . This type of enzyme has been 
found in nature expressed by bacteriophages that do not contain thymidine in their 
DNA, but instead incorporates compounds like deoxyuridine or hydroxymethyi- 

10 deoxyuridine. 

In the thymidine fermentation described in the '972 patent, the enzymes that degrade 
thymidine (thymidine phosphorylase and uridine phosphorylase) have been removed by 
mutation so that thymidine accumulates. Thus, the use of the dTMPase enzyme helps 
create the pathway to allow thymidine synthesis. An expression of dTMPase alone, 
however, may not assure a commercially viable level of thymidine production. 
Accordingly, there is a continuing need to enhance the production of thymidine by cells 
expressing dTMPase in order to make thymidine production by fermentation 
commercially viable, by lowering the production cost relative to the current chemical 
synthesis methods. 

The biochemical pathway for pyrimidine deoxynucleotide production, for example, in E. 
coli Is highly regulated at the levels of transcription and translation as well as at the 
protein level by mechanisms including attenuation, feedback inhibition and enzyme 
25 activation. Neuhard, J. and R.A. Kelln, Biosynthesis and Conversion of Pyrimidines, 
Chapter 35 [In] Neidhardt, F.C. et al. [eds] "Escherichia coli and Salmonella Cellular 
and Molecular Biology", Second Edition, Vol. I, pp580-599. ASM Press, Washington 
D.C., 1996. The expression of dTMPase and elimination of thymidine breakdown by 
mutations in the deoA (thymidine phosphorylase), udp (uridine phosphorylase) and tdk 
30 (thymidine kinase) genes and therefore resulting expression products results in 
thymidine synthesis in E. coli but not at a commerically viable level. 



2 
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Summary of the Invention 

The biosynthesis of purines and pyrimidines involves a common step of reducing a 
ribonucleoslde diphosphate (in some species triphosphate) to its corresponding deoxy 

5 analog. In the overall process the reduction of the ribose moiety to 2-deoxyribose 
requires a pair of hydrogen atoms which are ultimately donated by NADPH and H*. 
However, the immediate electron donor is not NADPH but the reduced form of a heat 
stable protein called thioredoxin or glutaredoxin and at least one other unidentified 
source since the E. coli ribonucleotide reductase system still works in trx/K (thioredoxin) 

10 grx (glutaredoxin) double mutants (Neuhard and Kelln, supra). The reducing 

equivalents of the reduced thioredoxin are transferred to ribonucleoslde diphosphate 
reductase which carries out the reduction process. Manipulation of, for example, this 
step could prove useful in improving the commerical production of purine and 
pyrimidine deoxynucleosides. 

15 

It is an object of the present invention to provide novel DNA constructs e.g. vectors and 
genetically modified microorganisms comprising said vectors particularly for use in the 
production of recoverable amounts, especially commercially useful amounts, of 
pyrimidine and purine deoxynucleosides. 

20 

It is also an object of the present invention to provide processes which represent an 
improvement over McCandliss and Anderson described supra. 

In accordance with one aspect of the present invention there is provided a DNA 
25 construct comprising a transcriptional unit which comprises a ribonucleotide reductase 
gene and a thioredoxin gene or a uridine kinase gene and/or a dCTP deaminase gene. 

In one embodiment the DNA construct comprises a transcriptional unit which comprises 
a ribonucleotide reductase gene and a thioredoxin gene. 

30 



In another embodiment the DNA construct comprises a transcriptional unit which 
comprises a uridine kinase gene and/or a dCTP deaminase gene. 
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Preferably the DNA construct comprises a transcriptional unit which connprlses a 
uridine kinase gene and a dCTP deaminase gene. 

Most preferably the DNA construct comprises a transcriptional unit which comprises a 
5 ribonucleotide reductase gene and a thioredoxin gene and a uridine kinase gene and 
a dCTP deaminase gene. 

In accordance with another aspect of the present invention there is provided a modified 
host cell comprising a DNA construct according to the Invention. 

10 

In accordance with yet another aspect of the present Invention there is provided a 
culture medium comprising the modified host cells of the invention and processes for 
the production of a purine or pyrimidine, for example thymidine .comprising the use of 
said modified host cells . 

15 

In one embodiment the host cells comprise a DNA construct which construct comprises 
a transcription DNA unit (e.g. operon) which unit comprises DNA sequences encoding 
for ribonucleotide reductase arid thioredoxin in which said reductase preferably displays 
less sensitivity to allosteric inhibition than a wild type host cell equivalent or counterpart 
20 wherein said cell further comprises one or more of the following features: 

(a) a transcription unit (e.g. operon), preferably located on said DNA construct, 
comprising DNA sequences encoding for (and preferably heterologous with respect to 
host cell equivalent) thymidylate synthase; 

(b) a transcription unit (e.g. operon), preferably located on said DNA construct, 
25 comprising DNA sequences encoding for uridine kinase and preferably dCTP 

deaminase; and 

(c) repressed or absent Uracil DNA glycosylase activity. 

In another embodiment the DNA construct for use in the production of recoverable 
30 amounts of pyrimidine and derivatives thereof, in particular pyrimidine 

deoxyrlbonucleosides such as thymidine, comprises a transcription unit (e.g. operon) 
which unit comprises ( preferably heterologous) DNA sequences encoding for uridine 
kinase and/or dCTP deaminase. 
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Genetically modified host cells comprising and expressing the construct and culture 
medium comprising the modified host cells are also provided.., 

5 This aspect is based, in part, on the observation that host cells comprising DNA 

encoding for uridine kinase and/or dCTP deaniinase, Optionally togather with additional 
genes as suggested in US 5,213,972 required for thymidine production, lead to a 
significant improvement in thymidine production. 

10 The respective aspects of the present invention disclose for the first time a plurality of 
advances on the teaching of U.S 5,213,972 to provide improved DNA constructs and 
host cells comprising the constructs for use in the commercial production of pyrimidine 
deoxyribonucleosides, particularly thymidine. 

15 Other objects, features and advantages of the present invention will become apparent 
from the following description. It should be understood, however, that these represent 
preferred embodiments of the invention and are by way of illustration only. Various 
modifications and changes within the spirit and scope of the invention will become 
apparent to those skilled in the art. 

20 

Preferred embodiments of the invention 

The construct of the present Invention may be chromosomal or more preferably extra- 
chromosomal e.g. located on a vector. 

25 

Vectors of the present invention include plasmid, virus, transposons, minichromosome 
or phage, preferably plasmid. The vector comprising the transcription unit may be 
introduced into the host cell according to any convenient method known to those skilled 
in the art, e.g. PI transduction, electroporation or transformation. Suitable host cells 
30 useful In the present invention include eukaryotes and prokaryotes (e.g. Bacterium). 
Prokaryotes include E. coli, Salmonella, Pseudomonas, Bacillus, strains and mutants 
thereof. E.coli is preferred due to the large amount of information, genetic tools and 
mutant alleles that are available. It is particularly preferred that a method of 

5 
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transduction is available for the host cell of choice to enable mutations to be readily 
moved from one host cell to another and facilitate genetic mutation of the host without 
requiring direct mutation whenever a new mutation is desired. 

The present inventors have found that the use of bacteriophage T4 nrd/K, nrdB and 
nrdO genes are particularly useful for encoding the reductase and thibredoxin in E. coli: 
See Sjoberg, B. M. ef a/., EMBOJ., 5:2031-2036 (1986); Tseng, M.-J., etaL, J. Biol. 
Chem. 263:16242-16251 (1988); and LeMaster, DM., J. Virol. 59:759-760 (1986). 
More specifically, a very significant improvement in E. coli thymidine production was 
achieved through the cloning and expression of the T4 bacteriophage nrdA and nrdB 
genes coding for ribonucleotide reductase together with T4 nrdC coding for thioredoxin 
since the T4 ribonucleotide reductase cannot use E. co// thioredoxin. The T4-coded 
ribonucleotide reductase was found to be relatively insensitive to control by allosteric 
inhibition in vitro compared to the E. coli enzyme (Berglund, O., J. Biol. Chem. 
247:7276-7281 , 1 972). For example, unlike the E. coli enzyme (Berglund, O., J. Biol. 
Chem. 247: 270-7275, 1972) the T4 ribonucleotide reductase is not inhibited by dATP, 
but actually stimulated by dATP and ATP (Berglund, O.. J. Biol. Chem. 247:7276-7281 , 
1972). 

DNA sequences encoding for the ribonucleotide reductase ( e.g. nrdA and nrdB genes) 
and thioredoxin ( e.g. nrdC gene) are preferably heterologous with respect to host cell 
DNA and preferably derived from T phage (preferably E. coin bacteriophage), 
particularly T "even" phages e.g. T2, T4 or T6. See Campbell, A.M., Bacteriophages, 
Chapter 123, In Neidherdt, supra; and Mathews, C.K. et al. (eds.) Bacteriaophage T4, 
American Society of Microbiology, Washington, D.C., 1983. The term "derived from" is 
intended to define not only a source in the sense of its physical origin but also to define 
material which has structural and/or functional characteristics which correspond to 
material originating from the reference source. 

Another useful feature of the T even phage enzyme is its substrate specificity. The 
normal E. coli ribonucleotide reductase uses UDP as a substrate only poorly since the 
K„ for UDP Is about 10 fold higher for UDP than CDP (Neuhard and Kelln, supra). 
However, the T4 enzyme has only a two-fold difference in (Berglund, O., J. Biol. 
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Chem. 247: 7276-7281, 1972) between CDP and UDP substrates allowing two routes 
to dUTP synthesis. Although there have been attempts to obtain functional expression 
of T4 ribonucleotide reductase in E. coli, previous efforts were only successful in 
expressing the components separately and could demonstrate activity only by mixing in 

5 vitro (Tseng, M.-J., P. He, J.M. Hilfinger, and G.R. Greenberg, J. Bacteriol. 172: 6323- 
6332, 1990). Whilst not being bound by theory, the inventors believe that perhaps due 
to the lack of the usual pattern of feedback inhibition, expression of T4 ribonucleotide 
reductase in E. coli is lethal and it must be carefully conditionally expressed. Further 
envisaged are genes that encode precursor forms of the reductase and/or thioredoxin 

10 which are processed to produce a mature form. Such processing may proceed via 
various intermediate forms. 

Vectors of the present invention preferably comprise a regulatory element ( e.g. 
promoter such as lambda Pl, operator, activator, repressor such as lambda repressor, 

15 particularly a temperature sensitive variant, and/or enhancer), appropriate termination 
sequences initiation sequences and ribosome binding sites. The vector may further 
comprise a selectable marker. Alternatively, regulatory elehients (particularly lambda 
repressor) may be located on the host cell chromosome. It is preferred that nrdA and 
nrdB are arranged in the vector downstream (in terms of reading frame) from nrdC. In 

20 particular, it is preferred that nrdB is arranged downstream from nrdA. Thus a most 
preferred arrangement is a vector comprising an operon comprising nrdCAB. 

The T4 ribonucleotide reductase is not devoid of feedback-control in vivo (J. Ji, R.G. 
Sargent, and C.K. Mathews, J.Biol.Chem. 266: 16289-16292, 1991 ; and Berglund 

25 supra). To promote ribonucleoside diphosphate reduction further e.g. for thymidine 
production, the gene coding for the regulatory subunit, nrdA, may be modified by, for 
example, a mutational approach to create an enzyme capable of Increased thymidine 
production due to e.g. a reduced sensitivity to allosteric inhibition for example inhibition 
by the enzyme's immediate product or inhibition by a product resulting from a 

30 downstream event. 

In order to construct T4 nrdA mutants, site-directed mutagenesis may be used to 
modify or change (e.g. substitute) gene bases encoding amino acids suspected to alter 

7 
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e.g. dTTP binding site involved in allosteric regulation. Analysis of the amino acid 
sequence of T4 ribonucleotide reductase revealed a segment that appears to fit well 
with a postulated consensus sequence thought to be involved in dTTP binding (E.M. 
Mcintosh and R.H. Haynes, Mol. Cell. Biol. 6:1711 -1 721 , 1 986). Several changes may 

5 be made in this region of the T4 ribonucleotide reductase using oligonucleotide-directed 
mutagenesis. The general approach "may be modelled after the effort of More et al. 
(More, J.T., J. M. Ciesia, L.-M. Changchien, G.F. Maley and F. Maley, Biochemistry 33: 
2104-21 12, 1994) to reduce the dTTP binding of deoxycytidylate deaminase. One 
mutation, ''^Ala to lie, in the T4 nrdA appeared to be very useful. For example, the 

10 thymidine productivity of strains containing the '^^Ala to lie mutant in T4 nrdA evaluated 
by a shake flask fermentation method was significantly increased. As demonstrated 
below, the present inventors achieved at least 25% increase over the parent strain 
without this single change. 

15 Although the ^®Ala to lie is one successful example, those skilled in the art will now 
realize that many other amino acid changes to this region are now possible to obtain 
the desired effect, that being to putatively disrupt dTTP binding, but not disrupt the 
enzyme's basic functionality. For example, substitution of ^®Ala with other amino acids 
displaying similar side chains to lie (e.g. leucine, valine) may be utilized. Modifications 

20 of position 79 in conjunction with other modifications (e.g. mutations) within the 

postulated consensus region are also envisaged. Deletion of one or more amino acid 
positions in the consensus region and introduction of synthetic DNA into the region are 
other approaches available to those skilled in the art. 

25 In another aspect of the present invention there is provided a host cell comprising a 

construct which construct (e.g. vector) comprises a transcriptional unit comprising DNA 
sequences encoding for heterologous ribonucleotide reductase and thioredoxin which 
reductase is less sensitive to allosteric inhibition than the wild type host cell equivalent 
or counterpart. It will be apparent to those skilled In the art that determining the relative 

30 sensitivity of a candidate heterologous reductase to allosteric inhibition compared to 
the wild type host cell equivalent is a matter of routine experimentation and observation. 
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Transcription units comprising the DNA sequences e.g. nrdA, nrdB and nrdC genes are 
preferably operons wherein the nrd genes are arranged in tandem. This permits 
transcription of these genes as a single mRNA transcript. In order to minimize 
unproductive energy expenditure by the host cell and further to minimize plasmid size, it 
5 is preferred that the operon contains only genetic sequences required in the encoding 
of reductase and thioredoxih (Including any regulatory or control elements). This may 
necessitate the removal of superfluous DNA (for example, the unusual intron in the 
phage T4 nrdB gene, Sjoberg, B-M., et al EMBO J.5: 2031 -2036, 1986). 

10 In other preferred embodiments, vectors of the present invention for use in for example 
the production of thymidine further comprise DNA sequences encoding for thymidylate 
synthase ( e.g. the td gene). See e.g. Chu, F. K. etal., Proc. Natl. Acad. Sci. USA 
81:3049-3053 (1984); Chu, F. K. et ai. J. Bacteriol. 169:4368-4375 (1987). The 
purpose of using this enzyme is to improve control over the levels of deoxyuridine 

15 produced and in particular the relative impurity level of deoxyuridine relative to 

thymidine. The dTMPase enzyme is not completely specific for dTMP. With a higher 
K^than for dTMP, the PBS1 dTMPase will also utilize dUMP as substrate to produce 
deoxyuridine (Price, A. R., Methods in Enzymol. 51 : 285-290, 1 978). Deoxyuridine 
creates a significant problem for thymidine purification. Therefore, one way to reduce 

20 deoxyuridine production is to efficiently convert dUMP to dTMP by increasing the level 
or effectiveness of thymidylate synthase such that the internal concentration of dUMP 
always remains very low. 

The thymidylate synthase gene (td) may be heterologous with respect to the host cell 
25 and it is preferred that td is derived from (in the sense defined supra) T bacteriophage, 
e.g. T "even" phage and in particular T4 phage td. Although td may be located in its 
own transcription unit. It is preferred that td is located In the same transcription unit e.g. 
operon as nrd genes. Moreover, it is preferred that td is located in the same operon 
downstream (in terms of reading frame) from the nrd genes. 

30 

McCandliss and Anderson, supra, amplified the E. coli thymidylate synthase gene in 
plasmids pCG138 and pCG148 (see Table 5, of the '972 patent) and It was found to be 
partially effective in reducing deoxyuridine. The T4 thymidylate synthase is much more 

9 
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effective which is surprising in light of the fact that the E .coli enzyme is not thought to 
be controlled by any type of allosteric regulation (Neuhard and Kelln, supra). The E. 
coli enzyme for dUMP, 4 iiM (Wahba, A. J. and M. Friedkin, J. Biol Chem. 237: 
3794-3801 ). and the T4 enzyme for dUM.P. 2.73 \xM (Maley, F., L. LaPat-Polasko, 
5 V. Frasca and G.F. Maley, Functional domains in T4 Thymidylate Synthase as probed 
by site-directed mutagenesis, Chapter 29 [In] Karam, J.D. [ed] "Molecular Biology of 
Bacteriophage T4", American society for Microbiology, Washington, DC, 1994, pp 322- 
325), are similar and cannot explain the large difference in effectiveness. Whilst not 
being bound by theory, the inventors believe that the E. coli thyA has an internal 
10 transcription termination sequence derived from an upstream gene that could be 
effecting the expression level in plasmid clones (Bell-Penderson, D, J.L. Galloway - 
Salvo, and M. Belfort, J. Bacterial. 173: 1193-1200, 1991). 

In other preferred embodiments, host cells of the present invention, particularly for use 
15 in the commercial production of pyrimidine deoxyribonucleosides e.g. thymidine 

comprise a transcription unit (e.g. operon) which unit comprises DNA sequences e.g. 
udk gene encoding for uridine kinase and preferably DNA sequences e.g. dcd gene 
encoding for dCTP deaminase. See e.g. Wang, L. and B. Weiss, J. Bacteriol. 
174:5647-5653 (1992); and Neuhard, J. and L. Tarpo, J. Bacteriol. 175: 5742-5743. 

20 

The construct of this aspect of the invention may additionally comprise a transcription 
unit encoding for ribonucleotide reductase {nrdA and nrdB) and the thioredoxin (nrdC), 
or precursor forms thereof which are preferably heterologous with respect to host cell 
DNA and preferably derived from E.coli bacteriophage, particularly T "even" phages e.g. 
25 T2,T4 or T6. 

Uridine kinase produces UMP and CMP from uridine and cytidine using GTP (or dGTP) 
as the phosphate donor. The reaction Is inhibited by UTP and CTP (J. Neuhard and 
R.D. Kelln, Biosynthesis and Conversions of Pyrimidines, Chapter 35 [in] F.C. Neidhart 
30 et ai. [eds], " Escherichia Coli and Salmonella Cellular and Molecular Biology", Second 
Edition, ASM press, Washington D.C). The present inventors have found that the use 
of uridine kinase particularly together with dCTP deaminase leads to a marked 
improvement in the production of thymidine by host cells incorporating these changes 
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together with the teachings of '972 outlined supra. This observation is quite 
unexpected since uridine kinase, on the basis of current information, has no direct role 
in pyrimidine de novo biosynthesis, moreover that its use would be beneficial in 
commercial processes for the production of pyrimidine deoxyribonucleosides. It is 

5 preferred that udk and dcd genes are arranged in tandem in the same operon. Further 
envisaged are genes that encode precursor forms of the udk and dcd gene which are 
processed to produce a mature form. Such processing may proceed via various 
intermediate forms. The udk and dcd genes may be introduced into the construct { e.g. 
vector) from any suitable source by methods well known to those skilled In the art for 

10 example PI transduction, electroporation or transformation. 



The enzyme uracil DNA glycosylase, encoded by the ung gene, is responsible for 
degrading DNA that has uracil incorporated in place of thymine. Where host cells of the 
present invention are used In the commercial production of e.g. thymidine, the internal 

15 cellular concentration of dTTP may be lowered as a result of the utilization of dTMP ( a 
precursor of dTTP) in the production of thymidine. Accordingly, the present inventors 
have recognized that there is potentially a greater propensity for uracil incorporation 
into the host DNA which may be lethal to a wild type host due to the uracil DNA 
glycosylase activity causing too many single stranded breaks in the host cell DNA. 

20 Thus, host cells useful in the present invention may further display repressed 
(compared to the unmodified cell) or no uracil DNA glycosylase activity. This 
repression or absence may be achieved through various ways apparent to those 
skilled in the art. For example, antagonism ( either total or partial) of the ung gene 
expression products is one such approach by introducing an antagonist of the 

25 functional enzyme (or precursor thereof) into the host cell. Other approaches include 
manipulating ung gene expression by e.g. modifying regulatory elements of ung gene 
expression or introducing mutations into the ung gene itself such that ung gene product 
expression displays little or no uracil DNA glycosylase protein and/or activity. Another 
approach is to delete the ung gene ( or functionally critical parts thereof) from host cell 

30 DNA. The absence or low level of uracil DNA glycosylase activity may be a feature of 
the host cell without the need for further manipulation. 



11 
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In preferred embodiments of the present Invention, each of the advances taught herein 
are incorporated into a host cell. The nrd, td, udk and dcd genes may be located on 
separate constructs but it is preferred that they are all located on the same construct 
e.g. vector. Thus in a particularly preferred embodiment of the present invention, a 

5 modified host cell is provided in which the cell comprises a DNA construct (e.g. vector) 
comprising a transcription DNA unit (e.g. operon) which Unit comprises DNA sequences 
encoding for (preferably a T even phage, e.g. T4) a modified ribonucleotide reductase 
and thioredoxin in which said reductase preferably displays less sensitivity to allosteric 
inhibition than wild type host cell equivalent or counterpart wherein said construct 

10 further comprises: 

(a) a transcription unit (e.g. operon) encoding for (preferably heterologous with respect 
to host cell equivalent or counterpart) thymidylate synthase and; 

(b) a transcription unit (e.g. operon), encoding for uridine kinase and preferably dCTP 

15 deaminase; 

and in which the host cell displays repressed or absent uracil DNA glycosylase 
activity. 

Host cells modified according to the present invention are particularly useful in the 
20 commercial production of pyrimldine deoxynucleosides. In a particularly advantageous 
use of the present invention, E. coli host cells comprising (harboring) a plasmid 
modified according to the present invention (particularly in conjunction with the 
teachings of the '972 patent) may be used in the commercial production of thymidine. 
Thus, host cells modified according to the present invention may further comprise 
25 dTMPase derived from e.g. PBS1 and the mutations taught In the '972 patent, e.g. 
deoA, tdk-1 and udp-1. 

Generally, a fermentation method is employed which involves submerging the cells in a 
culture medium contained within a suitable vessel. Following culturing under 
30 appropriate conditions, produced thymidine is harvested and purified (enriched), if 
necessary, to pharmaceutical grade according to standard protocols. The purified 
thymidine may then be used in the production of medicaments, e.g. pharmaceutical 
compositions such as AZT. 

12 
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Brief description of the drawings 

The present invention is illustrated by way of example only and with reference to the 
5 following figures in which: 

Fig.1 illustrates, schematically, a route for the construction of pCG366. It should be 
noted that the plasmids are not drawn to scale. 

10 Fig.2 illustrates, schematically, a route for the construction of pCG374 and pCG375. 

Fig.3 illustrates a map for plasmid pCG532. 

Fig.4 illustrates growth and thymidine production by recombinant E. coli strain 
15 CMG2451 (Ungl and CMG2492(Ung ) hosting a plasmid pCG366 (nrdCAB td) 
according to example 1 0. 

Fig.5 illustrates thymidine production in a 30-liter fermentor by E. coli CMG2451 
(pCG532). 

20 

Fig.6 illustrates TdR and UdR (mg/L) obtained according to the purification protocol of 
example 12. 

Example 1 

25 Cloning of T4 nrdCAB genes and demonstration of activity 

The bacteriophage T4 nrd/KB genes were cloned by performing the polymerase chain 
reaction (PGR) with isolated phage T4 DNA. The primers for PGR nrdfK gene were: 
5'-TAT TOT AGA CGA TTT TCA AGT TGA GGA CTT ATG C-3' (Seq. id 1 ); 
30 and 5'-TAT ATG GAT AAT TCA TTA CAA TTT ACA CGC TGC AC-3' (Seq. id 2). 



13 
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The restriction site Xba\ was the introduced at the beginning of the nrdA in the amplified 
DNA, and C/al was introduced at the 3' end of nrdA. The primers for PGR amplification 
of nrdB gene were: 

5'-TAT ATC GAT AAA TGT AAA TTT AAG GAT TCT AAA TG-3' (Seq. Id 3) and 
5 S'-TAT GTC GAG TGG TTA AAA GTA TTT TTT AAA AGT G-3' (Seq. id 4) . 

The restriction site C/al was thus introduced at the beginning of the nrdB, and Apa\ was 
introduced at the end of nrdB in the amplified DNA. The PGR fragments were cloned 
into plasmid vectors as illustrated in Fig. 1 according to techniques known to those 

10 skilled in the art. The cloned nrdAB genes were confirmed by enzyme activity assay. 
The T4 nrdC gene was cloned into pKC30 producing plasmid pDL51 (LeMaster, D.M., 
J. Virology 59: 759-760, 1986 ) and was supplied by D. LeMaster (Dept of Biochem., 
Univ. Wisconsin, Madison, Wl). The gene was sub-cloned into a plasmid with nrdAB 
genes as illustrated in the Figure 1 . The sources of starting materials and background 

15 information used in Figure 1 are listed in Table 1 . 

A synthetic transcriptional terminator was used for the construction of pCG198 and 
pCG301 (see Fig. 1 and Table 1). Specifically, plasmid pBG sk* obtained from 
Stratagene (La Jolla, California) was digested with restriction enzyme Apa I and 
20 Asp7A 8 1. Synthetic DNA containing the ECOTGP transcription termination sequence 
(d' Aubenton Garafa et. ai, J. Mol. Biol. 216: 839-843, 1990) was then ligated replacing 
the original sequence 



14 
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Table 1: Genealogy of Plasmid pCG366 and pCG532 and Materials Source 



Plasmid 


Gene(s) 


Pro- 
moter 


Vector 
origin 


Marker 


Derivation 


PACYC177 


amp kan 




p15A 


amp kan 


Chang and Cohen (1978) J. 
Bacteriol. 1 34:1 141-11 56. ATCC 
37031 


pBC sk+ 


Cam 


P/ac 


ColE^ 


Cam 


Stratagene.l 101 1 North Torrey 
Pines Road, La Jolla, CA 92037. 


pBluescript . 
Uks 


Amp 


P/ac- 


co/EI 


Amp 


Stratagene.l 101 1 North Torrey 
Pines Road, La Jolla, CA 92037. 
ATCC 87047 


pBR322 


amp tet 




colE^ 


amp tet 


Bolivar ef a/. (1977) Gene 2:95-113. 


pDL51 


T4 nrdO 




colE^ 


Amp 


LeMaster, J. Virol. 59: 759-760, 
1986 


pKC30 


Amp 




colE1 


Amp 


Shimatake, and Rosenberg. (1981) 
Nature 292:128. ATCC 37286 


PKTdAl 


lAtd 




colE-i 


Amp 


Westetal. (1986) J.aC. 
261:13446-13450. 


pT7/T3/18 


Amp 


phage 
T7/T3 


colE^ 


Amp 


Life Technologies, INC.,9800 
Medical Center Dr., Rockville, MD 
20897. 


pTZI 8U 


Amp 




colE't 


Amp 


Dio-riaa, ^uuu MiTrec iNOoei uf-, 
Hercules, CA 94547. 


pucia 


Amp 


Plac 


colEA 


Amp 


Yanisch-Perron et al. (1 985) Gene 
33:103-119. ATCC 37253 


pUC19 


"Amp 


Plac 


colE^ 


Amp 


Yanisch-Perron et al. (1985) Gene 
33:103-119. ATCC 37254 


pCG173 


>.Pl vector 


?.Pl 


colE-i 


Amp 


XP\_ in pBluescript 11 ks 


pCG196 


T4 nrdC 




CO/E1 


Amp 


T4nrdC in pUC19 


pCG198 


Cam 


P/ac 


colE-i 


Cam 


Synthetic transcription terminator in 

pBC sk+ 


pCG301 


X.Pl vector 


A.Pl 


co/El 


Cam 


XP\_ in pCG198 


pCG308 


T4 nrdC 


XPi 


CO/E1 


Cam 


T4 nrdC in pCG301 , LeMaster, J, 
Virol. 59: 759-760, 1986 


pCG312 


14 nrdA 


PT7 


CO/E1 


Amp 


T4 nrdA in pT7/T3/18 


pCG318 


NrdB 


PT7 


co/EI 




T4 rtrdB in pBluscript II ks 


pCG323 


NrdAC 




co/EI 


Cam 


T4 nrdAC in pCG301 




A/rdABC 




co/EI 


"^m 


74 rjrdABC in pCGSOi 




NrdAB 


Tp" 


co/EI 




T4 nrdAB in pCG301 


pCG334 


NrdAB 


XPl 


co/EI 


Cam 


T4 nrdAB in pCG301 


pCG337 


NrdCAB 


XPl 


co/EI 


Cam 


74 nrdCAB in pCG301 


pCG343 


NrdCAB 


XP^ 


co/EI 


Cam 


T4 nrdCAB in pCG301 


pCG356 


Td 


XPi_ 






T4 td in pCG301 


pCG358 


dcd, udk 




CO/E1 

(pUCl8 

) 


Amp 


E. colidcd. udk in pUCl8 


pGG360 


NrdCAB, td 


XPi^ 


co/EI 


Cam 


T4 nrdCAB td in pCG301 


pCG366 


NrdCAB, td 


XPi_ 


co/EI 

(pBR32 

2) 


Cam Tet 


1 nroLrAD, TO in pcsriOi^^, Mmp^ 


pCG374 


dcd, udk 




p15A 


amp 


E. colidcd, udk in pACYC177 


pCG376 


IPl vector 


XPt_ 


p15A 


amp 


pACYC177 with XP\_ & MCS of 
pCG301 


pCG464 


T4 nrdA 

Xbal/HindlU 

fragment 




(pTZlS 
U) 

colEi 


Amp 


Xbal/Hindlll fragment with nrdA 
sequence cloned into A7>fll/i/i«dlll sites 
of pTZlSU (BioRad Laboratories) 
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pCG492 


T4nrdAXbal/ 

Hindlll 

Fragment 




(pTZ18 
U) 

co/El 


Amp 


PCG464 with mutation in T4 wrdA 
sequence causing ^^ala ile alteration 


pCG494 


NrdCAB id 




co/El 

(pBR32 

2) 


Cam 
Tet 


KpnMAflW fragment from pCG494 
inserted into pCG366 adding 79ala -> 
ile mutation to nrdA 


pCG532 


NrdCAB, td, 
udk. Dcd 




CO/E1 

{pBR32 

2) 


Cam 


udk.dcd cloned into pCG494 



between /Vpa I and Asp7^ 8 I endonuclease sites. This fragment recreated the Apa I 
recognition sequence, but destroyed the Asp 71 8 I recognition sequence in the new 
plasmid . The inserted DNA had the following composition 
5 5' -CGAGC CCGCCTAATG AGCGGGCTTT TTTTT -3' 

3'-CCGGGCTCG GGCGGATTAC TCGCGCGAAA AAAAACATG - 5' 
(shown as respective strands in seq. id 5 and 6) 
produced from two oligonucleotides. 

10 Plasmid pCG1 98 was combined with pCG1 73 that contains the lambda Pl promoter 
from plasmid pKC30 cloned into pBluescript II ks (Stratagene, La Jolla, California) as 
shown in Figure 1 . Both pCB198 and pCG173 were digested with H/nd III and Asn I 
then ligated to create new plasmid pCG301 containing the lambda Pl promoter, multiple 
restriction enzyme cloning sites, and followed by the ECOTGP terminator sequence 

15 copied from the tryptophan operon leader peptide region. 

T4 ribonucleotide reductase activity was measured by HPLC by a method that does not 
involve the use of radioisotopes and UDP substrate. The direct ribonucleotide 
reductase assay contained 1 mM NADPH, 1 n\M DTT, 0.5 m/W dATP, 0.6 mM UDP, 20 

20 mM Tris (pH 8.0) and 5 mM MgClg. Under these conditions the E. coli ribonucleotide 
reductase is inhibited and is not detected. The enzyme reaction (100 ni.) was stopped 
by the addition of 10 of 50% trichloroacetic acid (TCA). After 10 minutes on ice, the 
samples are centrifuged in a microcentrifuge. The supernatant was extracted 4 times 
with diethylether to remove the TCA. Five ml of Tris buffer (1 .0 M pH 8.0) was added 

25 followed by 2 of 40 mg/mi. rattle snake venom (Sigma), and the sample was 

incubated at 37°C for 60 minutes. The samples were then heated for 3 minutes at JCC 
followed by centrifugation for 5 minutes to remove precipitate. The volumes are 
equalized, then analysed by HPLC with an UV detector and deoxyuridine as the 



wo 01/02580 PCT/GBOO/02357 

Standard. The column is a Spherisorb ODS-2, 5 micron, 250 mm X 4.6 mm using a 12 
mM ammonium phosphate (pH 5.0) mobile phase and a flow rate of approximately 1 .0 
mL/minute. Results are shown in Table 2 for cells containing plasmid pCG343 
demonstrating functional expression of T4 ribonucleotide reductase. 

5 

Table 2: Ribdriucleotide Reductase Activity 



Strain 


Induction condition 


Specific Acitivity 
nmol/10 min./mg protein 


CMG1093 


Uninduced 


0 


CMG1093 


Induced 


0 


CMG1093/pCG343 


Uninduced 


0 


CMG1093/pCG343 


Induced 


704.7 



EXAMPLE 2 

10 Derivation of Host Strain CMG2451 from Strain CMG1115 

Strain CMG1 115 is fully described in McCandliss & Anderson (U. S. Patent 5,213, 972). 
CMG1 1 1 5 was the starting point for development described herein. Strain CMG1 115 
was improved for thymidine productivity by selection for growth on medium containing 

15 30 mg/L of 5-f!uorouridine that yielded strain CMG2401 . Strain GMG2401 was then 
selected for growth on medium containing 30 mg/l of 3'-azido-3'-deoxythymidine which 
yielded strain CMG2404. CMG2404 requires L-proline for growth due to the inherited 
mutation A{Jac-pro) from its original parent JM101 . Hfr mating between GMG2404 and a 
Hfr stain CAG5053 { Singer, M. et al. Microbiological Review: 5:1-24,1989) was 

20 performed according to techniques known to those skilled in the art and yielded strain 
CMG2434 which is Lac*, Pro*. The udp (uridine phosphorylase) mutation in CMG2434 
still had partial uridine phosphorylase activity that was evident based on thymine 
accumulation after induction of thymidine production. The udp mutation was 
reintroduced from CGSC5128 (E. coli Genetic Stock Center, Yale University) by phage 

25 PI transduction according to techniques known to those skilled in the art. The 

mefE3079::TntO from strain CAG18491 was first transduced into CMG2434 to serve as 
a positive selection marker for transduction of udp. Then the udp-1 was transduced 
into the /77efE3079::Tn70 derivative of CMG2434 by selecting for growth without L- 
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methionine in the defined medium. The udp-^ derivative was named strain CMG2451 . 
The genealogy of CMG2451 is summarized in Table 3. 

Table 3: Genealogy of E. coli Host Strain CMG2451 

5 



strain 


^ , ^Genotype , ^ 4- 


^ i - T Derlviatibn "-^ ^ ; > 4 


CMG1115 


CMG1106 (Tn5::dTMPase kan") 


Tn5::dTMPase insertion from pCG132. 


CMG2401 


CMG1115 FUdR" 


5-fluoro-2'deoxyuridine resistance. 


CMG2404 


CMG2401 AZT" 


3'-Azido-3'deoxythymidine resistance. 


CMG2434 


CMG2404 Lac^ Pro* 


Repair A(/ac-proAB) by conjugation with 
CAGSOSa''. 


CMG2448 


CMG2434 me/E3079::TnrO 


metE3079::Tn10 from CAG 18491". 


CMG2451 


CMG2448 udp-^ metE* tef 


urfp-l from CGSC5128 " and replaced 
mefE3079::TnrO. 



Mutations were at times introduced into E. coli strains by phage PI transduction. If the mutation 
has a selective marker, direct PI transduction was used. If the mutation has no selective marl<er, PI 
cotransduction with the nearby TnIO insertion was used. 
10 *• Singer, M., et al. Microbiological Review: 53:1 -24, 1989. 

»• All CGSC strains can be obtained for E. coli Genetic Stock Center. Yale University. P.O. Box 
2081 04. New Haven , CT 06520-81 04. 

EXAMPLE 3 

15 Cloning and expression of T4 td gene Into thymidine production plasmid 

The T4 td gene was cloned into pKTdAI by West et al. ( J. Biol. Chem 261 :1 3446- 
13450, 1986) without the1017-base pair intron. The td gene was sub-cloned into 
pCG301 using two oligonucleotides as linkers with the following sequences: 
20 5'-GAT CCG GAG GAT AAA TGA AAC AAT ACC AAG ATT TAA T-3' (seq. id 7) and; 
5'.TAA ATC TTG GTA TTG TIT CAT TTA TOG TOG G-3' (seq.id 8). 

The result plasmid was pCG356. The td gene from pCG356 was then sub-cloned into 
pCG343 to create pCG360 (Fig. 1). The tetracycline resistance gene and plasmid 
25 replication origin from pBR322 (Bolivar, F. et al., Gene 2: 95-1 1 3, 1977) was sub- 
cloned into pGG360 and formed pCG366. The thymidylate synthase activity was 
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measured by spectrophotometric method of Wahba and Friedkin (J. Biol.Chem. 236: 
PC1 1 -PC1 2, 1 961 ). The results are shown in Table 4. 

Table 4: Thvmidvlate Synthase Activity 

5 



Strain 


Specific Activity 
AOD340/mg protein/min 


C1VIG1093 


-0.72 


CMG1093/ pKTdDI 


3.24 


CMG1093/pCG356 


3.45 


CMG1093/pCG360 


1.43 



EXAMPLE 4 

Shake flask data showing the value of T4 td gene on thymidine production and 
deoxyuridine reduction 

10 

The shake flask fermentation was used to evaluate the thymidine productivity of 
different E. coli recombinants. The shake flask fermentation broth and methods used 
here, and in other examples, is described in Example 6 below. In this case 20 mL 
volume per flask was inoculated with 2 ml of seed culture and incubated in a 30°C 

15 shaker. At about 1 0 OD 600nm, the flasks were transferred into a 37^C shaker for 30 
minutes to mildly induce the X P^_ promoter. Then the flasks were transferred into a 35°C 
shaker to continue the fermentation. Glucose was fed during fermentation as needed, 
and pH was adjusted to about 7 with ammonia according to the color of phenol red. The 
thymidine concentration was measured by HPLC using a Spherlsorb ODS-2, 5 micron, 

20 250-mm X 4.6 mm column, and a 25 mM ammonium phosphate (pH 3.3) mobile phase 
with a flow rate of approximately 1 .5 ml/minute. 

The strain CMG2451 /pCG366 (T4 nrdCAB, td) was compared with CMG2451 /pCG343 
(T4 nrdCAB) in shake flask fermentation. Table 5 shows that deoxyuridine 
25 concentration was reduced, and converted to thymidine in strain CMG2451/pCG366 
due to the T4 td gene. 
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Table 5: Thymidine Production-Effect of Thvmidvlate Synthase at 66 hours . 



Stain 


Thymidine (mg/l) 


Deoxyuridine (mg/l) 


% Deoxyuridine 


CMG2451/pCG343 


1198 


1728 


144.2 


CMG2451/pCG366 


3327 


206 


6.2 



EXAMPLE 5 

5 

Construction of the ^^Ala to lie T4 ribonucleotide reductase mutant 

The mutation was introduced by using site-directed mutagenesis based on a method 
described by Kunkel (Kunkel, T.A., Proc. Natl. Acad. Sci. ,USA, 82: 488-492,1985). All 

10 materials for mutant construction including pTZ18U phagemid DNA, M13K07 helper 
phage, bacterial strains E. coli CJ236 and MV1 1 90, T7 DNA polymerase and T4 DNA 
ligase were provided in the Muta-Gene in vitro mutagenesis i^it from Bio-Rad 
Laboratories (Hercules, CA). At first, the X£)al/H//?dlll DNA fragment containing the nrdA 
gene of T4 bacteriophage was isolated from plasmid pCG312 (ChemGen Corp., Table 

15 1 above). The XbaUHindWl DNA fragment was cloned into the Xba\/IHind\\\ sites of the 
pTZ18U phagemid vector using standard protocols (Sambrook, J., Fritsch, E.F. and 
Maniatis, T., Molecular Cloning, Cold Spring Harbor Laboratory, New York ,1989). The 
insert-carrying phagemid pCG464 was introduced into E. coli CJ236. This strain is 
deficient for dUTPase {dut) and uracil-N-glycosylase {ung) which results in an 

20 occasional substitution of uracil for thymine in newly synthesized DNA. Single stranded 
DNA of pCG464 containing uracil was isolated from CJ236 according to Bio-Rad 
Laboratories Instructional Manual. This DNA (0.2 pMole) was annealed with 6 pMole of 
phosphorylated primer 

5 -AGC AAA CAT TAA ACA GCG TGC AATTAC ATA TTG ATA ATC AGG TTC-3' 
25 (sequence id 9) containing the sequence of the desired mutation (underlined) coding for 
He instead of original ^^Ala. 

Complementary strand DNA was synthesized by using T7 DNA polymerase as 
described in the Bio-Rad protocol. The reaction products were transformed into E. coli 
30 MV1 190 containing a wild type uracil-N-glycosylase, which degrades the uracil- 

containing parental strand, thus enriching for the mutant strand. Direct DNA sequencing 

20 
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using the Silver Sequence DNA Sequencing System from Promega Corp. (Madison, 
Wl) identified plasmids containing the desired mutation. It appeared that all four 
analysed transformants contained plasmids with the mutation. One of them was 
designated pCG492 and used for further experiments. To check if the mutation affects 

5 thymidine synthesis it was introduced into production plasmid pCG366 (ChemGen 
Corp., Table 1 and Figure 1). For this, the Kpn\/Afl\\ DNA fragment of pCG366 
containing 5'-part of the nrdA gene was replaced with Kpn\/Afl\\ fragment from pCG492 
that contains the mutation. The new plasmid pCG494 was introduced into production 
strain CMG2451 (ChemGen Corp., Example 2 and Table 4 above). The effect of T4 

10 nrdA mutation was evaluated by comparison of thymidine production by CMG2451 
(pCG494) and CMG2451 (pCG366) as shown in Example 6 below. 

EXAMPLE 6 

Shake flask fermentation for Thymidine production using CMG2451 (PCG366) and 
15 C1VIG2451 (PCG494) 

The 250-mL baffle flasks containing 25 mL of production medium were inoculated with 
2 mL of a freshly grown seed culture in LB broth with appropriate antibiotic added. The 
cultures were grown in a SO^C shaker at 250 rpm. When the ODgoo reached about 5, the 

20 flasks were transferred into a 37°C shaker for 30 min. Then the flasks were transferred 
into a 35°C shaker to continue the fermentation. The production medium has the 
following composition (g/L): Ardamine YEP-S (Red Star Yeast & Products, Milwaukee, 
Wl) - 10; CaCOg -10; MgS04 - 0.4; phenol red - 0.24; PP90BT (DMV International, 
Fraser, N.Y.) - 4.5; sorbitol - 20; chloramphenicol - 0.03; trace elements (1000X) - 

25 1 mUL. The trace elements (1 OOOX) formulation is the following (g/L): boric acid - 0.05; 
calcium chloride - 20; cobalt sulfate - 0.05; copper sulfate - 0.01 ; ferrous sulfate - 20; 
ferric chloride - 20; manganese sulfate - 0.5; sodium molybdenate - 0.1 ; and zinc 
sulfate - 0.1 . At the time of Induction 10 grams per liter of Ardamine YEP-S was added. 
Glucose was fed during the fermentation on an average of every two hours (2.5 g/L). 

30 The pH in the flasks was maintained at approximately 7.0 through the addition of 4N 
NH4OH as judged by the color of the phenol red indicator dye. The ODgoo was read after 
sample dilution 1 :10 into 10 mM H2SO4 to dissolve salts in the medium. Thymidine 
concentration was measured by reverse phase C-1 8 HPLC with an Alltech Spherlsorb 
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ODS-2 column and Shimadzu spectrophotometric detector at 260 nm. The mobile 
phase was a 25 mM NH4H2P04 (pH 3.3) in water at the constant rate of 1 .5 mL/mln. 



The results of thymidine production by CMG2451 (pCG366) and CMG2451 (pCG494) 
after 2, 1 7 and 25 hours after Induction are presented in Table 6. There were two 
repeats of the flasks m this experiment and variability between duplicate flasks did not 
exceed 15%. The results show that the T4 nrdA mutant performed better than the wild 
type nrdA strain. This was confirmed in several independent shake flask experiments 
with the same bacterial strains. 

Table 6: Thymidine production by CMG2451 (pCG366) 
and CMG2451 (pCG494) 

Strain 2 hours 1 7 hours 25 hours 

Specific activity (mg/L/OD) 
CMG2451 {pCG366) 6l 32^5 43l 

CMG2452 {pCG494) 8.5 36.1 51.4 



EXAMPLE 7 

Shake Flask Fermentation to Demonstrate the Effect of the E. coU udk Gene on 
Thymidine Production. 

The dcd gene or dcd udk operon were cloned into the pACYC177 vector. This vector 
with the pi 5a origin of replication is different from co/EI based plasmids such as 
pCG366 and thus is compatible and can be maintained in the same host with co/EI 
based plasmids. The details of the plasmid constructions resulting in pCG374 {udk 
dcd) or pCG375 {dcd) are shown in Figure 2. The genes on these plasmids are 
expressed from the native E. coli promoter of the udk dcd operon. 

Using selection for ampicillin resistance, plasmids pCG374 and pCG375 were 
Introduced into GMG2451 {pCG366) to test the effect on thymidine production in the 
shake flask fermentation method described in Example 6. The results at several time 
points are shown in Table 7. Although the specific activity (thymidine per OD of cells) i 
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similar both with and without the udk gene, the cells with the udk gene on the second 
plasmid pCG374 grew to a higher cell density and produced significantly more 
thymidine (5.8 g/L compared to 3.0 g/L for the strain with the dcd only second plasmid 
). This result was not anticipated, as it is not clear why uridine phosphorylase could 
have this effect. 

Based on this data and other information, the udk gene was chosen for introduction 
along with the E. coli dcd gene in the construction of plasmid pCG532 (see Example 8 
and Figure 3). 



Table 7: Comparison of the effect of second plasmids with dcd or dcd udk on 
thymidine production in the Ci\AG2451 (pCG366) baclcground. 



Time 
(hour) 


O.D.600 


Thym 
(m. 


udine 

g/0 


Specific Thymidine 
(mg/l/OD) 


Base 
Strain 
And 
Plasmid 


CMG 
241 
(pCG 
366) 


CMG 
2451 
(pCG 
366) 


CMG2451 
(pCG366) 


CMG2451 
(pCG366) 


CMG2451 
(pCG366) 


CMG2451 
(pCG366) 


Second 
pACYC 

177 
based 
plasmid 


PCG 
374 
with 
dcd 
udk 


pCG375 
with 
dcd 


pCG374 

with 
dcd udk 


pCG375 
with 
dcd 


pCG374 
with 

dcd udk 


PCG375 
with 

dcd 


8 hr 


29.112 


29.94 


889 


884 


30.5 


29.5 


18 hr 


40.206 


32.946 


1721 


1608 


42.8 


48.8 


42 hr 


48.348 


33.96 


4020 


2740 


83.1 


80.7 


66 hr 


57.498 


28.614 


5804 


3012 


100.9 


105.3 



EXAIViPLE 8 

Cloning of the E. coli udk dcd operon into production plasmid pCG494 . 



The udk and the dcd genes of E. coli encode pyrlmidine ribonucleotide kinase and 
dCTP deaminase, respectively. Both genes were mapped to a 3.4 kb BamH\IPst\ DNA 
fragment of lambda phage 355 of the Kohara genomic library (Kohara, Y., Akiyama, K. 
and Isono, K., Cell 50, 495-508, 1987). It appears that udk is located upstream of dcd 
and transcribed in the same direction as dcd (Neuhard, J. and Tarpo, L., J. Bacterial. 
175: 5742-5743, 1993). The genes were cloned into production plasmid by two steps. 
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At first, a 3.4 kb BamH\/Pst\ DNA fragment from lambda 355 was cloned into the 
multiple cloning site of plasmid pUC18 {Yamisch-Perron, C, Vieira, J. and Messing, J., 
Gene 33: 103-109,1985) (plasmid pCG358). Then, the fragment was excised from the 
polylinker region of pCG358 with BamHI and Sph\ and cloned In place of a 0.7 kb 

5 Bgl\\/Sph\ fragment (containing a portion of the tetracycline resistance gene) of 
production plasmid pCG494. The final 14.7 kb plasmid pCG532 contains co/E1 
compatibility group origin of replication, the chloramphenicol resistance gene, the udk 
and the dcd genes and the T4 bacteriophage nrdCAB (encode thioredoxin and two 
subunits of ribonucleotide reductase, respectively) and the T4 td (encodes thymidylate 

10 synthase) genes under control of the Pl promoter of bacteriophage lambda. The T4 
nrdA gene of pGG532 was previously changed by site-directed mutagenesis C^AIa to 
lie). 

A synthetic transcriptional terminator is located downstream of the td gene to prevent 
15 transcriptional readthrough into the replication region. The genetic map of plasmid 
pCG532 is illustrated in Figure 3. 

EXAMPLE 9 

Shake flask fermentation of thymidine bv CMG2451 {pCG494> and 
20 CIWG2451(pCG532) 

Plasmid pCG532 containing the udk and the dcd genes of E. coli was introduced into 
production strain CMG2451 . New strain CMG2451 (pCG532) was tested together with 
parent strain CMG2451 (pCG494) in shake flask experiments to compare thymidine 
25 production. The results of two independent experiments are shown in Table 8. The first 
experiment was performed as described above and samples were taken at 1 7 hours 
after induction. In the second experiment cells were induced at higher OD (about 9) and 
samples were taken at 3 hours after induction for analysis. In both cases the strain 
containing the cloned udk and dcd genes performed better than the parent strain. 
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Table 8: Thymidine fermentation by thymidine by CMG2451 fpCG494^ and 
CMG2451 (PCG532) 



Experiment 1 








Strain 


O.D 600 


TdR (mg/L) 


Specific Activity(mg/l_yOD) 


CMG2451 (pCG494) 


16.5 


599 


36.3 


CMG2451 (pCG532) 


17.5 


867 


49.5 










Experiment 2 








Strain 








CMG2451 (pCG494) 


8.5 


149 


17.5 


CMG2451 (pCG532) 


8.4 


192 


22.8 



5 Example 10: Addition of una mutation and its effect on thymidine production in 
Shake flask fermentation 

An uracil DNA glycosylase negative strain was constructed by introducing an ungiTnIO 
(Varshney. U., eta!., J. Biol. Chem. 263:7776-7784, 1988) mutation into host CMG2451 

10 using PI transduction as described above, and was named CMG2492. Plasmid 

pCG366 was introduced into CMG2492. A comparison experiment between Ung" and 
Ung* strain for thymidine synthesis in shal<e flasks is shown in Figure 4 using the flask 
method described in Example 6. Cells were grown in 250ml flasks at 30°C, and 
thymidine synthesis was induced by shifting temperature to 37°C for 30 min, then, 

15 shifting to 35°C. The Ung- host without uracil DNA glycosylase kept growing longer and 
made 30% more thymidine. 

EXAi\flPLE 11 

Thymidine Production in a 30 Liter Fermentor with strain CimG2451 rpCG532) 

20 

The following conditions were used to produce thymidine In a 30-Liter fermentor (B. 
Braun Biotec Biostat C) with strain CMG2451/532. The seed culture (500 mL)was 
grown in a 4 Liter baffle shake flask in LB medium (5 q/L Difco yeast extract, 10 g/L 
Difco tryptone, 5 g/L NaCI) with 30 mg/L chloramphenicol and 25 mg/L kanamycin at 30 
25 ° C until 2.37 OD 600 nm was reached with a final pH of 6.69. 
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The initial batch in the fermentor (1 2 Liters) containing the composition listed in Table 9 
was sterilized at 121° C for 55 minutes. After cooling a separately autoclaved solution 
(500 mL) was added to adjust the batch to 20 g/L sorbitol and 3.0 qIL MgS04 7 HjO. 
Also added before inoculation was a sterile filtered solution (200 mL) designed to to 
5 adjust the initial batch to 30 mg/L chloramphenicol, 25 mg/L kanamycin, 1 mg/L d- 
biotin, 1 0 mg/L thiamine and 10 mg/L nicotinic acid. 

Three feed solutions were prepared: a) Cerelose 2001 (dextrose monohydrate) 562 g/L 
with 2 mg/L biotin, 20 mg/L thiamine, 20 mg/L nicotinic acid, and 30 mg/L 

10 chloramphenicol; b) sorbitol 717 g/L with with 4 mg/L biotin, 40 mg/L thiamine, 40 mg/L 
nicotinic acid, and 60 mg/L chloramphenicol; and c) crude nitrogen mixture containing 
360 g/L Amberex 695 AG (Red Star Yeast & Products, Milwaukee, Wise), 6 g/L PP90M 
(DMV International, Fraser, NY) with IX trace elements and 0.1 mL/L Mazu 
DF1 0PMOD1 1 (BASF). The feed solutions were sterilized for 40 to 50 minutes underl 8 

15 PSI steam pressure. 

Table 9 : Initial batch composition in 30 L fermentor 





Concentration 


Component 


(g/L ormUL) 


Sodium hexametaphosphate 


4 


KH2PO4 


4 


(NH,),HP04 


4 


CaCU 


0.4 


Citric acid 


0.5 


Amberex 695 (Red Star Yeast & Products, 
Milwaukee, Wl) 


20 


PP90M (DMV International, Fraser, NY) 


15 


Tryptone (Difco, Detroit, Ml) 


5 


1 0OOX Trace elements (see Example 7) 


1 mL/L 


CaCOa 


5 


Glycine 


1.83 


Mazu DF10PMOD11 Defoamer (BASF 
Speciality Products, Gurnee, IL) 


0.2 mL/L 



20 

The operating conditions were as follows: initial temperature 31 "C; RPM 600; air flow 3 
LPM; pH 6.8; pressure 0 Bar. The dissolved oxygen was controlled at 25% saturation 
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using air flow rate control loop. The RPM was increased to 750 RPM at 8 hours, 850 
RPM at 9 hours and 950 RPM at the time of temperature shift from 31 to 35.5°C at 9.2 
hours when the culture reached 37.8 OD. Beginning at 9.7 hours back pressure was 
applied up to a maximum of 0.6 Bar to aid in oxygen transfer into the culture. The rate 
of temperature shift for induction of thymidine synthesis was 0.2°C per minute. 

After the cell mass reached 20 OD at 600 nm (read after dilution in 50 mM H2S04to 
dissolve salts), 85 mL batch feeds of the sorbitol feed solution (b) were made for each 5 
OD increase in cell mass. At 16.7 hours sorbitol batch feeding was stopped and 
dextrose monohydrate (glucose) feed (a) was started under the control of the DO PID 
control loop of the B. Braun fermentor with a set point of 25%. Simply stated, when 
dissolved oxygen was below 25% sugar feed was off, and when DO was greater than 
25% the sugar feed was set to on. The protocol self regulates the glucose 
concentration keeping the concentration low, but does not allow the culture to starve for 
glucose for a very prolonged length of time. Crude nitrogen feeds (500 mL) were made 
at 1 1 hours, 16.2 hours, 21 .2 hours, 28.2 hours, 33.2 hours and 40.5 hours. 

The cell mass, thymidine and deoxyuridine accumulation during the fermentation are 
shown in Figure 5. 

EXAMPLE 12 
Purification of Thymidine from Fernfientation broth. 

Dowex Optipore L-285 (The Dow Chemical Company) was suspended in delonized 
water and packed into a 48mM diameter glass column making a bed volume of about 
500 mL. The column was washed with 500 mL of 5% NaOH, then washed with 
deionized water to until the effluent was pH 7.0. 

500 mL Fermentation broth with thymidine (TdR) concentration of 4.890 g/L and 
deoxyuridine (UdR) concentration of 1 .040 g/L was loaded onto the column. The 
column was washed with two bed volumes of deionized water. TdR and UdR were 
eluted by two bed volumes of 5% reagent alcohol (Ethanol 90.5%, Methanol 4.5%, 
Isopropyl alcohol 5%) followed by two bed volumes of 10% reagent alcohol and two 
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bed volumes of 15% reagent alcohol. The TdR and UdR in the 25 mL fractions is 
shown in Fig. 6. The column was regenerated by washing with 5% NaOH, then with 
deionized water to pH7.0 and the procedure was repeated. The fractions 91 -1 60 were 
pooled together from the two separate runs and dried using a rotary vacuum 
evaporator. Thymidine was re-dissolved in a minimal amount of hot water and 
crystallized at 4°C. Then the crystals were re-cfystallized two times and dried in a 55*'C 
oven for 1 5 hours. A total of 979.8 mg of crystalline thymidine was obtained with a 
purity of greater than 99% that should be suitable for use as a pharmaceutical 
intermediate. 

The side fractions containing both deoxyuridine and thymidine were pooled with mother 
liquors and reduced by a rotary vacuum evaporator to a final volume of 1 200 mL with 
reduced alcohol. The total amount of TdR in this 1200 mL solution was 3571 mg. The 
total recovery (including the 3571 mg of TdR side fractions) was 93.1%. 
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1 . A DNA construct comprising a transcriptional unit which comprises a 
ribonucleotide reductase gene and a thioredoxin gene or a uridine kinase gene 
and/or a dCTP deaminase gene. 

2. A DNA construct as claimed in claim 1 comprising a transcriptional unit which 
comprises a ribonucleotide reductase gene and a thioredoxin gene. 

3. A DNA construct as claimed in claim 1 comprising a transcriptional unit which 
comprises a uridine kinase gene and/or a dCTP deaminase gene. 

4. A DNA construct as claimed in claim 1 comprising a transcriptional unit which 
comprises a uridine kinase gene and a dCTP deaminase gene. 

5. A DNA construct according to any of claims 1-4 wherein the construct is an 
exlrachromosomal vector. 

6. A DNA construct according to claim 5 wherein the vector is a plasmid, virus, 
transposon, minichromosome or phage. 

7. A DNA construct according to claim 6 wherein the vector is a plasmid, wherein the 
ribonucleotide reductase gene is a nrdA gene and/or a nrdB gene, and wherein 
the thioredoxin gene is a nrdC gene and said genes are arranged in an operon. 

8. A DNA construct according to claim 7 wherein the transcriptional unit comprises a 
nrdA gene and a nrdB gene. 

9. A DNA construct according to claim 8 in which the nrdA and nrdB genes are 
located downstream of the nrdC gene. 

10. A DNA construct according to claim 8 wherein the nrdC gene is upstream of the 
nrdA gene and nrdA gene is upstream of the nrdB gene. 
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11. A DNA construct according to claim 6, further comprising a regulatory element. 

12. A DNA construct according to claim 1 1 , wherein the regulatory element is selected 
from the group consisting of a promoter, an operator, a termination sequence, an 
initiation sequence and a ribosome binding site. 

13. A DNA construct according to claim 1 2 wherein the promoter is the lambda Pl 
promoter or a derivative therefrom. 

14. A DNA construct according to claim 1 2 wherein the termination sequence is a 
synthetic terminator. 

15. A DNA construct according to claim 7 wherein the nrdfK gene is modified such that 
ribonucleotide reductase encoded by the unit is less sensitive to ailosteric 
inhibition than the ribonucleotide reductase encoded by the unit comprising an 
unmodified nrd/K gene. 

16. A DNA construct according to claim 1 5, wherein the nrdA gene is modified at a 
dTTP binding site. 

17. A DNA construct according to claim 1 5 wherein the nrdfK gene is modified so as to 
encode an Ala-to-lle change at position 79 in the nrd/K expression product. 

18. A DNA construct according to claim 7 wherein the nrdK nrdB and nrdC genes are 
derived from a T phage. 

19. A DNA construct according to claim 1 8 wherein the T phage is a T even phage. 

20. A DNA construct according to claim 1 9 wherein the T even phage is a T4 phage. 

21 . A DNA construct according to any of claimsl -4 wherein the construct further 
comprises a thymidylate synthase gene. 
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22. A DNA construct according to claim 21 wherein the thypriidylate synthase gene is 
the td gene. 

23. A DNA construct according to claim 22 wherein td gene is located on the same 
vector as the nrdA, nrdB and nrdC genes. 

24. A DNA construct according to claim 22 wherein the td gene is located in the same 
operon as the nrdATnrdB and nrdC genes. 

25. A DNA construct according to claim 24 wherein the td gene is located downstream 
(in terms of reading frame) from the nrdA, nrdB and nrdC genes. 

26. A DNA construct according to claim 22 wherein the td gene is derived from a T 
phage. 

27. A DNA construct according to claim 26 wherein the td gene Is derived from a T 
even phage. 

28. A DNA construct according to claim 27 wherein the td gene is derived from T4 
phage. 

29. A DNA construct according to claim 2 further comprising a uridine kinase gene 
and/or a dCTP deaminase gene. 

30. A DNA construct according to claim 29, wherein the DNA construct comprises 
both a uridine kinase gene and a dCTP deaminase gene. 

31 . A DNA construct according to claim 1 .3 or 4 wherein the uridine kinase gene is a 
udk gene. 

32. A DNA construct according to claim 1 ,3 or 4 wherein the dCTP deaminase gene is 
a dcd gene 
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33. A modified host cell comprising a DNA construct according to any of the preceding 
claims. 

34. A modified host cell according to claim SSwherein the host cell displays repressed 
or no uracil DNA glycosylase activity. 

35. A modified host cell according to claim 34 wherein one or more host cell DNA 
sequences encoding for uracil DNA glycosylase activity have been modified so as 
to encode expression products displaying repressed, low levels of or no uracil 
DNA glycosylase activity. 

36. A modified host cell according to claim 35 wherein the host cell DNA sequence is 
an ung gene. 

37. A modified host cell according to claim 34 wherein one or more host cell DNA 
sequences encoding for uracil DNA glycosylase activity have been removed. 

38. A modified host cell according to claim 33 wherein the cell is a eukaryote or a 
prokaryote. 

39. A modified host cell according to claim 38 wherein the host cell is selected from 
the group consisting of E. coU, Salmonella, Pseudomonas, Bacillus and 
Saccharomyces. 

40. A modified host cell comprising a DNA construct, which construct comprises a 
transcription DNA unit, which unit comprises a ribonucleotide reductase gene and 
a thioredoxin gene, wherein said ribonucleotide reductase displays less sensitivity 
to allosteric inhibition than the wild-type equivalent of the reductase, and wherein 
said host cell further comprises one or more of the following features: 

(a) a transcription unit located on said DNA construct, comprising a thymidylate 
synthase gene heterologous to the thymidylate synthase gene of the host 
cell; 
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5 



(b) 



(c) 



(d) 



a transcription unit located on said DNA construct, comprising a uridine 

kinase gene; 

a transcription unit located on said DNA construct, comprising a dCTP 
deaminase gene; and 

repressed or absent uracil DNA glycosylase activity. 



41 . A modified host cell according to claim 40, wherein the ribonucleotide reductase 
gene is modified at a dTTP binding site. 

10 42. A modified host cell according to claim 41 , wherein the ribonucleotide reductase is 
the nrdA gene and the modification is an Ala-to-lle change at position 79 in the 
nrdA expression product. 

43. A modified host cell according to claim 38, wherein the DNA construct comprises 
15 both the uridine kinase gene and the dCTP deaminase gene. 

44. A modified host cell according to claim 40, wherein the cell comprises each one of 
the features of (a) to (d). 

20 45. A process for the production of pyrimidine deoxyrlbonucleosides comprising 
culturing a host cell according to any of claims 33-44. 

46. A process according to claim 45 wherein the deoxyribonucleoside Is thymidine. 

25 47. A process for the manufacture of azidothymidine comprising culturing a host cell 
according to any of claims 33-44. 

48. A culture medium comprising a host cell according to any of claims 33-44. 
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SEQUENCE LISTING 

<1 10> Glaxo Group Limited 
Anderson, David M 
Liu, Lin 

5 Podl<ovyrov, Sergey 
Wang, Baomin 

<1 20> Vectors, cells and processes for pyrimidine 
deoxyribonucleosides production 

10 

<130> P1049 

<140> 
<141> 

15 

<150> US 09/345492 
<151> 1999-07-01 

<150> US 60/141827 
20 <1 51 > 1999-07-01 

<160>9 

<170> Patentin Ver. 2.1 

25 

<210> 1 
<211>34 
<212> DNA 

<213> Artificial Sequence 

30 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 1 

35 tattctagac gattttcaag ttgaggactt atgc 34 



<210>2 
<211>35 
40 <212>DNA 

<21 3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 

45 <400> 2 

tatatcgata attcattaca atttacacgc tgcac 



<210>3 



1 

SUBSTITUTE SHEET (RULE 26) 



wo 01/02580 PCT/GBOq/02357 

<211>35 
<212> DNA 

<213> Artificial Sequence 
5 <220> 

<223> Description of Artificial Sequence: Primer a, 

<400>3 . . _ . . 

tatatcgata aatgtaaatt taaggattct aaatg 35 

10 

<210>4 
<211>34 
<21 2> DNA 

<213> Artificial Sequence 

15 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 4 

20 tatgtcgact ccttaaaagt attttttaaa actc 34 

<210> 5 
<211> 30 
<212> DNA 
25 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: DNA insert 
30 <220> 

<223> Part of a double stranded DNA insert. The 
non-overhanging part of SEQ ID NO: 6 is the 
complement 

35 <400> 5 

cgagcccgcc taatgagcgg gctttttttt 30 

<210>6 
<211> 38 
40 <212> DNA 

<21 3> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: DNA Insert 

45 

<220> 

. <223> Part of a double stranded DNA insert. SEQ ID NO: 5 
is the complement of the non-overhanging part of 
the sequence 
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<220> 

<221 > misc_feature 
<222> (1)..(4) 
5 <223> Overhang 

<220> 

<221> nnisc_feature - . . . . . 

<222> (35). .(38) 
10 <223> Overhang 

<400> 6 

gtacaaaaaa aagcccgctc attaggcggg ctcgggcc 38 

15 <210>7 
<211>37 
<212> DNA 

<21 3> Artificial Sequence 

20 <220> 

<223> Description of Artificial Sequence: 
Oligonucleotide 

<400> 7 

25 gatccggagg ataaatgaaa caataccaag atttaat 37 

<210>8 
<211>31 
<212> DNA 
30 <21 3> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide 

35 

<400> 8 

taaatcttgg tattgtttca tttatcctcc g 31 



40 <210>9 
<211> 45 
<212> DNA 

<21 3> Artificial Sequence 
45 <220> 

<223> Description of Artificial Sequence: Primer 
<400> 9 

agcaaacatt aaacagcgtg caattacata tlgataatca ggttc 45 
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